Strategies for reducing the complexity of a RNN based speech recognizer
نویسندگان
چکیده
Recurrent Neural Networks (RNN) provide a solution for low cost Speech Recognition Systems (SRS) in mass products or in products with energetic constraints if their inherent parallelism could be exploited in a hardware realization. Actually, the computational complexity of SRS based on Fully Recurrent Neural Networks (FRNN), e.g. the large number of connections, prevents a hardware realization. Here we introduce Locally Recurrent Neural Networks (LRNN) in order to keep the properties of RNN on the one hand and to reduce the connectivity density of the network on the other hand. By simulation experiments it is shown that the recognition capability of LRNN is equivelant to that of FRNN and superior to other proposed network archi-tectures. Furthermore, it is shown that with an appropriate representation of the network paramaters and a retraining of the network 5 Bit quantization of the weights and activities is possipble without signiicant loss in recognition performance.
منابع مشابه
An RNN-based preclassification method for fast continuous Mandarin speech recognition
A novel recurrent neural network-based (RNN-based) frontend preclassification scheme for fast continuous Mandarin speech recognition is proposed in this paper. First, an RNN is employed to discriminate each input frame for the three broad classes of initial, final, and silence. A finite state machine (FSM) is then used to classify the input frame into four states including three stable states o...
متن کاملAn RNN-Based Pre-classi cation Method for Fast Continuous Mandarin Speech Recognition
A novel RNN-based front-end pre-classiication scheme for fast continuous Mandarin speech recognition is proposed in this paper. First, an RNN is employed to discriminate each input frame for the three broad classes of initial, nal, and silence. A nite state machine (FSM) is then used to classify the input frame into four states including three stable states of Initial (I), Final (F), and Silenc...
متن کاملA Speech Recognizer with Low Complexity Based on Rnn
Speech recognition systems (SRS) designed for applications in low cost products like telephones or in systems with energetic constraints like autonomous vehicles are faced with the demand for solutions with low complexity. A small vocabulary consisting of a few command words and the digits is suucient for most of the applications but has to be recognized robustly. Here we report about investiga...
متن کاملSpeech Emotion Recognition Using Scalogram Based Deep Structure
Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...
متن کاملOn improving discrimination capability of an RNN based recognizer
This paper presents a set of e ective and e cient techniques to improve the discrimination capability of a recurrent neural network (RNN) based isolated word recognizer. The recognizer contains a set of individually trained RNN speech models (RSMs). Each of them represents a di erent word in the vocabulary. Speech recognition is performed by selecting the RSM that best matches the input utteran...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996